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Observing and controlling complex networks are of paramount interest for un¬ 
derstanding complex physical, biological and technological systems. Recent 
studies have made important advances in identifying sensor or driver nodes, 
through which we can observe or control a complex system. Yet, the observa¬ 
tional uncertainty induced by measurement noise and the energy required for 
control continue to be significant challenges in practical applications. Here we 
show that the variability of control energy and observational uncertainty for 
different directions of the state space depend strongly on the number of driver 
nodes. In particular, we find that if all nodes are directly driven, control is ener¬ 
getically feasible, as the maximum energy increases sublinearly with the system 
size. If, however, we aim to control a system through a single node, control in 
some directions is energetically prohibitive, increasing exponentially with the 
system size. For the cases in between, the maximum energy decays exponen¬ 
tially when the number of driver nodes increases. We validate our findings in 
several model and real networks, arriving to a series of fundamental laws to 
describe the control energy that together deepen our understanding of complex 
systems. 


Many natural and man-made systems can be represented as networks[IH3], where nodes 
are the system’s components and links describe the interactions between them. Thanks 
to these interactions, perturbations of one node can alter the states of the other nodes[D- 
E]. This property has been exploited to control a network, i.e. to move it from an initial 
state to a desired final state by manipulating the state variables of only a subset of its 
nodes [TO]. Such control processes [TQl - f26] play an important role in the regulation of protein 
expression[27], the coordination of moving robots[28]. and the inhibition of undesirable social 
contagions[2B]. At the same time the interdependence between nodes means that the states 
of a small number of sensor nodes contain sufficient information about the rest of the network, 
so that we can reconstruct the system’s full internal state by accessing only a few outputs[ L 30j. 
This can be utilized for biomarker design in cellular networks, or to monitor in real time 
the state and functionality of infrastructural [3Tj and social-ecological [32] systems for early 
warning of failures or disasters [33]. 
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While recent advances in driver and sensor node identification constitute unavoidable 
steps towards controlling and observing real networks, in practice we continue to face signif¬ 
icant challenges: the control of a large network may require a vast amount of energy [T51 - H7] 
and measurement noise [3l] causes uncertainties in the observation process. To quantify these 
issues we formalize the dynamics of a controlled network with N nodes and N d external 
control inputs as [Tj ITOl 

x(f) = Ax(t) + Bu(t), (1) 

where the vector x(t) = [x±(t), x 2 (t ),..., £jv(t)] T describes the states of the N nodes at time 
t and Xi(t) can represent the concentration of a metabolite in a metabolic network[35]. the 
geometric state of a chromosome in a chromosomal interaction network [13], or the belief 
of an individual in opinion dynamics[291 T6i . The vector u (t) = [ui(t), u 2 (t ),... ,MAr D (t)] T 
represents the external control inputs, and B is the input matrix with B t j = 1 if control 
input Uj(t) is imposed on node i. The adjacency matrix A captures the interactions between 
the nodes, including the possibility of self-loops An representing the self-regulation of node 
i. 

Control energy 

The system (1) can be driven from an initial state x G to any desired final state x^ within the 
time t G [0,r] using an infinite number of possible control inputs u(t). The optimal input 
vector aims to minimize the control energy[7] J Q r ||u(f)|| 2 df, which captures the energy of 
electronic and mechanic systems or the amount of effort required to control biological and 
social systems. If at t — 0 the system is in state x G = 0, the minimum energy required to 
move the system to point x^ in the state space can be shown to be|Zl!I5HEZl 

^( T ) = x dG~ 1 (r)x d , ( 2 ) 

where G c (r ) = J Q r e At BB T e ATt dt is the symmetric controllability Gramian. When the system 
is controllable all eigenvalues of G c (t) are positive. Eq. (2) indicates that for a network 
A and an input matrix B the control energy £{r) also depends on the desired state x^. 
Consequently, driving a network to various directions in the state space requires different 
amounts of energy. For example, to move the weighted network of Fig. la to the three 
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different final states with ||xd|| = 1, we inject the optimal signals u(£) shown in Fig. lb 
onto node 1, steering the system along the trajectories shown in Fig. lc. The corresponding 
minimum energies are shown in Fig. Id. The control energy surface for all normalized 
desired states is an ellipsoid, implying that the required energy varies dramatically as we 
move the system in different directions. 

As real systems normally function near a stable state, i.e. all eigenvalues of A are 
negative[37]. the control energy £(r) decays quickly to a nonzero stationary value when 
the control time r increases [15]. Henceforth we focus on the control energy £ = £{r —> oo) 
and the controllability Gramian G = G c (r — y oo). 

Given a network A and an input matrix B, the controllability Gramian G is unique, 
embodying all properties related to the control of the system. To uncover the directions 
of the state space requiring different energies, we explore the eigen-space of G. Denote 
by £i the eigen-energies, i.e. the minimum energy required to drive the network to G' s 
eigen-directions. According to Eq. ([2]) £j = 1/Hi with ji t corresponding to G°s eigenvalues. 
Generally, the energy surface for a network with N nodes is a super-ellipsoid spanned by 
G’s N eigen-energies. To determine the distribution of these eigen-energies we decompose 
the adjacency matrix as A = HAH T , where V represents the eigenvectors of A and A = 
diag{—Ai, — A 2 , • • •, — Xn} are the eigenvalues. For stable undirected networks all eigenvalues 
of A are negative, thus we denote the eigenvalues by —Aj so that the absolute eigenvalues are 
A* > 0 for all i. We sort the absolute eigenvalues in ascending order 0 < Ai < A 2 < ... < Xn, 
finding that (SI Sec. I) 


G = V[{V t BB t V)oC}V t , 


(3) 


where o denotes Hadamard product, i.e. (.X o Y)^ = X t jY lv and C is a matrix with entries 
For a given network, (3) captures the impact of the input matrix B on the 


G — 1 

Ai+A i 


control properties of the system, allowing us to analyze the distribution of eigen-energies for 
different number of driver nodes and determine the required energy for each direction. 


Controlling a system through all nodes 

If we can control all nodes, i.e. No = N, B becomes a unit diagonal matrix. In this case G = 
Hdiag{;^-}I/ T and the eigen-directions of the controlled system are the same as the network’s 
eigenvectors. Thus £■; = and p(£) = (l/2)p(A), i.e. the distribution of eigen-energies is 


4 





proportional to the distribution of the network’s absolute eigenvalues. We add self-loops as 
An = — {S + YlJ=i Aij) where 5 > 0 is a small perturbation to ensure that all eigenvalues of A 
are negative. This scheme has been widely used in previous studies on dynamical processes 
taking place on networks, such as opinion dynamics[29], synchronization (38], and control |15| . 
For networks with degree distribution [TH3] p(k) ~ A; -7 the distribution of A’s absolute 
eigenvalues also obeys a power law|39, 00] p( A) ~ A -7 (see SI Sec. II A). Consequently, 

p(£) ~ £~ 7 , (4) 

indicating that the system can be easily driven in most directions of the state space, requiring 
a small £. A few directions require considerable energy and the most difficult direction 
needs [Hj £ max r\j N i- 1 . The fact that £ max sub-linear in iV for 7 > 2 indicates that, when 
N d = N, the energy density £/iV D remains bounded. I 11 Fig. 2 we test the prediction Q for 
several model, infrastructural, social, and biological networks. We fold that p{£) follows a 
power law for uncorrelated or correlated scale-free model networks (Figs. 2a-b) , the airline 
transportation network (Fig. 2d), the Internet AS-level network (Fig. 2e), an Israeli social 
network (Fig. 2g), the user-interaction network of an online forum (Fig. 2h), the human 
protein-protein interaction network (Fig. 2j), and the human heterogeneous network (Fig. 
21) in line with the prediction Q. In contrast, for several networks with bounded degree 
distribution, like the Erdos-Renyi random network (Fig. 2c), the US power grid network 
(Fig. 2f), the interlocking network of Norwegian companies (Fig. 2i), and the functional 
coactivation network of the human brain (Fig. 21), p(£) is also bounded, as predicted by 
7 —> 00 in Q. Such networks require even less energy for controlling their progress in 
their most difficult direction. Taken together, we find that for N d = N the distribution of 
eigen-energies is uniquely determined by network topology and we lack significant energetic 
barriers for control. 

Controlling a system through a single node 

If all nodes exhibit nonidentical self-loops we can control an undirected network by driving 
only a single node pH Wl \. In this case V r BB T V = {V ih Vj h } ~ 0(l/N), where h is the 
index of the chosen driver node. Thus V T BB T V can be viewed as a small perturbation to 
the matrix C in (|3]). The statistical behavior of G’s eigenvalues is mainly determined by the 
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eigenvalues of C, which can be approximated by Cholesky factors[43]. As mentioned above, 
for networks with p(k) ~ /c -7 , the distribution of A’s absolute eigenvalues also follows a 
power law, providing the i-th eigenvalue \ ~ ( N ^ r 1 _ i ) 7-1 (SI Sec. IIB). If 7 —> 0, i.e. for 
extremely heterogeneous networks [IT, ITT] . the eigenvalue gaps g t = Aj + i — A* are identical. 
For 7 —> oo (homogeneous networks), g t is again uniform. Hence it is reasonable to assume 
Qi = Q for all i, allowing us to analytically obtain the distribution of eigen-energies as 
p(S) ~ 1/(1 + 1 /S)S~ 1 (see SI Sec. Ill A, B). Therefore, 

p{£) ~ S~ x (5) 

for large £. Equation (5) predicts that, to drive a stable network of N nodes with a sin¬ 
gle driver node, the most difficult direction in the state space requires £ max ~ e N energy 
(SI Sec. Ill C). This exponential ^-dependence makes the control of large networks in 
the most difficult direction energetically infeasible. For validation we also consider the 

we obtain 

p>(£) (ln£ max — hi £), decreasing linearly with ln£. We test our prediction on several 
network models (Figs. 3a-c) and real networks (Figs. 3d-l), finding that the corresponding 
eigen-energies span over a hundred orders of magnitude and this exceptional range of vari¬ 
ations are reasonably well approximated by ([5]) for both p>(£) and p(£). Taken together, 
if we attempt to control a network from a single node (Ab = 1 ), the required energy varies 
enormously for different directions, almost independently of the network structure, making 
some directions prohibitively expensive energetically. 

Controlling a system through a finite fraction of its nodes 

When p(£) ~ £ -7 , the distribution p{£) ~ e^ 1-7 ^ where £ = \n£. Thus, if N d = N, p(£) 
is an exponential (one-peak) distribution for 7 > 2 in (|4j); if A^d = 1 , as p(£) £ 1 in 0 , 

p{£) is a uniform distribution. To understand the transition from Q for A"d = AT to ([HJ) 
for Ab = 1, we investigate the distribution p{£) when 1 < N d < N, i.e. when we try to 
control a system through a finite fraction of its nodes. In this case we find that p(£) has 
multiple peaks (Fig. 4a), which are induced by gaps in the eigen-energy spectrum (Fig. 4b). 
For Nv/N = 0.6, a gap separates the eigen-energies into two bands, such that the lower 
band contains Affi eigen-energies. This gap leads to two peaks in the distribution p(£) as 


complementary cumulative distribution p >(£) = / £ max p(£')d£'. Based on ( 
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shown in Fig. 4c. When we have fewer driver nodes (Nd/N decreases), the number of peaks 
iVpeak increases (Fig. 4a). We find that iV peak = intfiV/iVo], predicting A^ pea k = 2,4,5 for 
N-d/N = 0.5, 0.25, 0.2, respectively (see also SI Fig. S3). The multi-peak nature of p(£) has 
two important implications. First, the boundary of the first energy band £n 0 varies only 
weakly with N& (Fig. 4d), indicating that the energy required to move the network within 
the subspace spanned by the first N& eigen-directions is relatively small. Second, £ (i.e. 
log£) grows linearly from one band to the next (SI Fig. S4). Thus, log£ max (the boundary 
of the last band) is linearly dependent on the number of peaks, i.e. £ max r\j e N/N ° (Fig. 
4d). Controlling a single node induces N peaks in p(£), consequently the distribution p(£) 
becomes uniform (SI Fig. S3), resulting in p{8) r^j £ 1 of ({b}) and £ max rs_/ e N . We numerically 
test the prediction £ max ~ e N ^ N ° for several real networks (Figs. 4e-j), the result being in 
excellent agreement with our prediction. 

In Table 1 we summarize our findings about the distribution of eigen-energies and the 
maximum energy required to control a system towards the most difficult direction. 

Implications to observational uncertainty 

The results obtained above have direct implications for observability as well. Indeed, consider 
a system governed by the dynamics 


x(f) = Ax.(t) (6) 

y(f) = Cx(t) + w(t) (7) 

with an initial state x G ^ 0, where C is the output matrix and y (t) are the output signals 
including measurement noise w(£), which we assume to be a Gaussian white noise with zero 
mean and variance one. We aim to estimate x 0 of the initial state x Q while minimizing the dif¬ 
ference / Q r ||y(£) — y(t)\\ 2 dt between the output y (t) that is actually observed and the output 
y (t) = Ce At it 0 that would be observed in the absence of noise. With the maximum-likelihood 
approximation[45]. the expectation (x G ) = x D and the covariance matrix[45] (xx 1 ) = G~ l {r ), 
where x = x Q — x 0 is estimation error and G 0 (t) = J Q r e ATt C T Ce At dt is the observability 
Gramian. Therefore, the variance a 2 of the approximation in direction x is 

a 2 (r) =k T G4 1 (r)k, (8) 
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indicating that the estimation uncertainty varies with the direction of the state space. To 
illustrate this, consider the network in Fig. le that moves along the trajectory of Fig. lg, 
while we measure the state of the sensor node and plot the noisy output y (t) in Fig. If. 
With the maximum-likelihood approximation we reconstruct x a from y(t) and show the 
estimation error x = x Q — x G for thousands of independent runs (Fig. lh). The estimation 
variance is different for various directions, forming an uncertainty ellipsoid. Thanks to the 
duality between G c (r) and G 0 (t), the control energy for a direction in Fig. Id represents 
the estimation variance for the same direction in Fig. lh. 

To be specific, due to the duality of the controllability Gramian G c in ([2]) and the ob¬ 
servability Gramian G 0 in (|8]), we have cr 2 = £ for the same direction, implying that the 
least controllable direction (i.e. the direction requiring the most energy) is also least ob¬ 
servable (having highest uncertainty). Therefore, our findings about the distribution of 
eigen-energies apply directly to the distribution of a 2 along the eigen-directions: if all nodes 
are sensor nodes (IVg = N ) we have p(a 2 ) ~ (cr 2 ) -7 ; if we attempt to observe the system 
from a single node (IVg = 1) we have p(cr 2 ) ~ (cr 2 ) -1 ; and for a finite fraction of sensor nodes 
(1 < iVs < N) the largest observational uncertainty cr ^ aY decreases exponentially when the 
number of sensor nodes increases, i.e. cr 2 iax ~ e N ^ Ns . 

Beyond the degree distribution 

Real networks have a number of additional properties that are not encoded by their degree 
distributions, like local clustering[36], degree correlations[4?J, and community structure|48] 
(SI Table SI). To assess the impact of these topological characteristics we perform the 
degree-preserved randomization[49j on each network, eliminating local clustering, degree 
correlations and modularity. We find that the distribution of eigen-energies required to 
drive each randomized network follows the predictions (4) and (5) (Figs. S6 and S7 in SI), 
indicating that degree distribution is the main factor determining p(S). When the number 
of driver nodes increases, the maximal control energy for the randomized networks decreases 
exponentially, as predicted earlier (Fig. S8 in SI). We also validate the predictions (4) and 
(5) on model networks with positive or negative degree correlations (Figs, lb and 2b). 
All the tests indicate that the strength of local clustering, degree correlations or commu¬ 
nity structure have only minor influence on the behavior of control energy. Consequently, 
our calculations for uncorrelated networks capture the correct fundamental dependence of 


control energy for real networks. 

Many real networks have dead ends, i.e. nodes with one degree, which can undermine 
the stability of complex systems[50]. To test the impact of dead ends on control energy we 
explored several real networks that contain a considerable number of one-degree nodes (see 
SI Table SI). As shown in Figs. 2-4 and S6-S8 the predictions are robust against such dead 
ends. 

Conclusion and discussion 

The energy required for control is a significant issue for practical control of complex systems. 
By exploring the eigen-space of controlled systems we found that if all nodes of a system 
are directly driven, the eigen-energies can be heterogeneous or homogeneous, depending on 
the structure of underlying networks. Yet, if we wish to control a system through a single 
node, the eigen-energies are enormously heterogeneous, almost independently of the network 
structure. Finally, if a finite fraction of nodes are driven, the maximum control energy 
decays exponentially with the increasing number of driver nodes. Taken together, our results 
indicate that even if controllable, most systems still have directions which are energetically 
inaccessible, suggesting a natural mechanism to avoid undesirable states. Indeed, many 
complex systems, such as transcriptional networks for gene expression[51] and sensorimotor 
systems for motion control[52], only need to function in a low-dimensional subspace. Due to 
the duality of controllability and observability, our results also imply that, if we monitor only 
a small fraction of nodes, the observation can be extremely unreliable in certain directions 
of the phase space. 

It is worth noting that linear dynamics captures the behavior of nonlinear systems in 
the vicinity of their equilibria. The formalism (1) has been widely used to model diverse 
networked systems [1511171 23 , 221E5J (see also SI Sec. VII A, B), allowing us to reveal the role 
of the network topology on the fundamental control properties of complex systems]!]]], ITU 
M\- Indeed, if the linearized system (1) is controllable, the original nonlinear system is 
locally controllable[53]. The corresponding control energy is also highly heterogeneous for 
different directions, if we constrain the system’s trajectory to be local (SI Sec. VII C). 
Moreover, if the linearized dynamics of a nonlinear system is controllable along a specific 
trajectory, the original nonlinear system is locally controllable along the same trajectory[53]. 
This implies that our results can be potentially extended to describe control properties of 
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nonlinear systems in the vicinity of their stability basin[50l [53]. Yet, in this case, the 
linearized dynamics becomes time-varying, and the required energy for controlling time- 
varying systems remains an open problem that deserves future attention. 
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Controlling 


Observing 




control energy observational uncertainty 


FIG. 1: Controlling and observing a network, a, The control of a three-node weighted network 
with one external signal u(t) that is injected to the red driver node. Hence the input matrix is 
B = [1,0,0] T . The nodes have negative self-loops which make all eigenvalues of the adjacency 
matrix A negative, b, Optimal control signals which minimize the energies required to move the 
network from the initial state x D = [0,0,0] T to three different desired states with ||xd|| = 1 in 
the given time interval t E [0,3]. c, The trajectories of the network state x(t) driven respectively 
by the control signals in b. d, The control energy surface, showing the amount of energy required 
to move the network by one unit distance (i.e. ||x c ;|| = 1) in different directions. The surface is 
an ellipsoid spanned by the eigen-energies for the controllability Gramian’s three eigen-directions 
(arrows). The squares correspond to the three final points used in b and c. e, Observing the 
network with one output y(t). Node 1 is selected as the sensor (green), thus the output matrix 
is C = [1,0,0]. The measurement noise w(t) is assumed as Gaussian white noise with zero mean 
and variance one. f, A typical output y(t ) that is used to approximate the initial state x D . g, 
A typical trajectory of the system state x(f). h, Estimation error x = x D — x D , where x D is the 
maximum-likelihood estimator of the initial state. Starting from the same initial state we ran the 
system 5,000 times independently, each dot representing the estimation error of one run. The 
uncertainty ellipsoid (black) corresponds to the standard deviation of x in any direction. 
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FIG. 2: Controlling a network through all nodes ( N d = N). The panels show the distribution p(£) 
of eigen-energies required to control several model and real systems: a, a scale-free model network 
without degree-degree correlation; b, scale-free model networks with positive (r = 0.25) or negative 
(r = —0.24) degree-degree correlation; c, an Erdos-Renyi model network; d, the airline transporta¬ 
tion network; e, the Internet AS-level network; f, the US power grid network; g, an Israeli social 
network; h the user-interaction network of an online forum; i, the interlocking network of Nor¬ 
wegian companies; j, the human protein-protein interaction network; k, the human heterogeneous 
network; and 1, the functional coactivation network of the human brain. The straight lines show 
prediction © and the error bars represent standard deviations. For model networks the edges’ 
weights Aij are uniformly drawn from [0,1]. The self-loops An = — ^ ~2j Aij — 5 where 5 = 0.25, 
representing a small perturbation to diagonal entries, to ensure that the network is stable. The 
data sources and basic characteristics of these networks are discussed in SI Sec. VI A. 
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FIG. 3: Controlling a network through a single node (Ad = 1). The panels show the comple¬ 
mentary cumulative distribution p>(£) of eigen-energies required to control several model and real 
systems: a, a scale-free model network without degree-degree correlation; b, scale-free model net¬ 
works with positive (r = 0.34) or negative (r = —0.36) degree-degree correlation; c, an Erdos-Renyi 
model network; d, a sampled airline network; e, an Internet AS-level network in 1997; f, an IEEE 
power grid test network; g, the human-contact network of the ACM Hypertext 2009 conference; h, 
an email interaction network; i, the phone call network between different countries; j, a connected 
component of the human gene-coexpression network; k, a mutualism ecological network in Mauri¬ 
tius; and 1, the inter-region network of cat cortex. The insets are the log-log plots of probability 
distributions p(£) with logarithmic binning[3j. The straight lines show prediction ([5]) and the error 
bars represent standard deviations. The data sources and basic characteristics of these networks 
are discussed in SI Sec. VI A. 
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FIG. 4: Controlling a network through a finite fraction of its nodes, a, The multi-peak distribu¬ 
tions p[log(£)] for 1 < IVd < N, where the dots represent numerical results. The solid curves and 
shaded areas are smoothed for illustration. £at d denotes the boundary of the first energy band that 
contains IVd eigen-energies. £ max is the maximum control energy corresponding to the most difficult 
direction. The number of peaks is -/Vp ea k = int[IV/IVD] (see also SI Fig. S3), b, The eigen-energy 
spectrum for controlling the network. There is a gap in the logarithmic scale between the lYo-th 
and the (IVd + l)-th smallest eigen-energies, which leads to the two-peak distribution p[log(£)] 
in c. d, log(£ max ) and log(£Ar D ) as functions of N/Nd, indicating that £ max ~ e N / N v while £n d 
depends weakly on IVd- We also test the prediction in real networks: e, the North European power 
grid network; f, the interlocking network of Norwegian companies; g, the functional coactivation 
network of the human brain; h, an Internet AS-level network in Dec 1998; i, the user-interaction 
network of an online forum; and j, the metabolic network of C. elegans. The straight lines show 
the prediction £ max ~ e N ^ N ° and the error bars represent standard deviations. The data sources 
and basic characteristics of these networks are discussed in SI Sec. VI A. 

17 











































Table 1: Controlling complex networks with different number of driver nodes 


Number of driver nodes 

Distribution of eigen-energies 

Maximum control energy 

N D = N 

p(£) ~ £~ 7 

£max ~ 1 

1 < Nd < N 

A^peak = int[iV/iV D ] for p[log(£)] 

s ~ e N / N ° 

°max ° 

n d = 1 

p(£) ~ S- 1 

c N 

^max ^ ^ 


N is the total number of nodes and N d is the number of driver nodes. 7 is the exponent of 
the degree distributions p(k) ~ A; -7 . For large 7 the network becomes degree-homogeneous, 
behaving similarly to a random network. 
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